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We give a derivation of general relativity (GR) and the gauge principle that is novel in pre- 
supposing neither spacetime nor the relativity principle. We consider a class of actions defined on 
superspace (the space of Riemannian 3-geometries on a given bare manifold) . It has two key proper- 
ties. The first is symmetry under 3-diffeomorphisms. This is the only postulated symmetry, and it 
leads to a constraint linear in the canonical momenta. The second property is that the Lagrangian 
is constructed from a 'local' square root of an expression quadratic in the velocities. The square root 
is 'local' because it is taken before integration over 3-space. It gives rise to quadratic constraints 
that do not correspond to any symmetry and are not, in general, propagated by the Euler-Lagrange 
equations. Therefore these actions are internally inconsistent. However, one action of this form is 
well behaved: the Baierlein-Sharp-Wheeler (BSW jl[) reparametrisation-invariant action for GR. 
From this viewpoint, spacetime symmetry is emergent. It appears as a 'hidden' symmetry in the 
(underdetermined) solutions of the Euler-Lagrange equations, without being manifestly coded into 
the action itself. In addition, propagation of the linear diffeomorphism constraint together with 
the quadratic square-root constraint acts as a striking selection mechanism beyond pure gravity. If 
a scalar field is included in the configuration space, it must have the same characteristic speed as 
gravity. Thus Einstein causality emerges. Finally, self-consistency requires that any 3- vector field 
must satisfy Einstein causality, the equivalence principle and, in addition, the Gauss constraint. 
Therefore we recover the standard (massless) Maxwell equations. 



Traditionally the 3-1-1 dynamical formulation of GR, with its associated Hamiltonian and Lagrangian structures, 
is obtained by projection from spacetime. Geometrodynamics has a four-dimensional symmetry from the start. In 
contrast, the classic study of Hojman, Kuchaf , and Teitelboim |^ starts with a three-dimensional fully constrained 
Hamiltonian and requires its constraint algebra to close by reproducing the standard Dirac algebra for GR. In this 
way, they recover the ADM Hamiltonian. However, by insisting on the specific structure 'constants' of the Dirac 
algebra, they still presuppose spacetime. In fact, this is unnecessary. We shall show that mere closure of the algebra 
is enough to obtain GR. 

We shall begin by explaining how the Baierlein-Sharp-Wheeler action, which is at the centre of our investiga- 
tion, arose from the ADM formalism Q. In the ADM approach, one starts from spacetime (a pseudo- Riemannian 
4- manifold). One observes that the 'surfaces' of constant label time are spacelike (Riemannian) 3-geometries. Ge- 
ometrodynamics is the evolution of these 3-geometries. Choosing 3-coordinates, one gets a smooth curve of 3-metrics, 
gij{x^,t), i,j = 1,2,3 with the coordinates. The 5ij's depend on the slice (labelled by t), on the coordinates and 
on the point on the slice. The coordinates can be transformed because only the 3-geometry matters, not the 3-metric. 
Moreover, the transformation can be changed freely from slice to slice as well as from point to point. 

The key concepts in the ADM formalism are the 3-metrics gij, the lapse N and the shift iV\ The lapse measures 
the rate of change of proper time w.r.t. the label time, while the shift determines how the coordinates are laid down 
on the successive 3-geometries. Prior to the transition to the Hamiltonian, the standard Hilbert-Einstein action for 
matter- free GR is rewritten, after divergence terms have been omitted, in the 3+1 form 



Here R is the three-dimensional scalar curvature, and Kij — — {1 /2N) {dgij / dt— Ni-j — Nj-i) is the extrinsic curvature. 
From here the transition made by BSW is trivial. They first replaced Kij in the action by kij = dgij / dt— Ni.j — Nj-i, 
the unnormalised normal derivative, to give 
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d^x. (1.2) 



They varied this action with respect to the lapse and found an algebraic expression for it, 



„ = y/^^^a^. ,1.3) 

This, of cours e, is clearly consistent with the ADM Hamiltonian constraint. In turn, this expression for N is substituted 



back into Eq.(Ll) to obtain the BSW Lagrangian 

/ = y ^y/R^k':ihj -trk^. (1.4) 

There is a large class of four-dimensionally generally covariant theories of matter fields coupled to gravity in which 
the Lagrangian is a sum of potential-type terms (like R) + quadratic products of the 'proper' time derivatives, all 
multiplied by N. In all of them, one can repeat the process above, solve for the lapse algebraically and pass trivially 
to the BSW form. 

The BSW action has attracted relatively little attention. It was initially proposed, especially by Wheeler, as the 
starting point of a method of solving the initial-value problem of GR. The original BSW paper Q| formulates the 'thin- 



sandwich problem', which will be mentioned later. However, in this paper, we propose to take (1.4) as paradigmatic 
for the dynamics of the universe. We shall recall some basic concepts and identify the two key properties of BSW-type 
actions. Then, in view of its relative unfamiliarity, we shall give a fairly extended outline of our approach, which we 
call the 3-space approach. 

Wheeler Q called the set of Riemannian 3-geometries on a given topology superspace, which is the ADM configura- 
tion space and the space we shall use here. However, we actually believe that conformal superspace, which is obtained 
by quotienting superspace by three-dimensional conformal transformations, may well be the true configuration space 
for gravity Q, but in this paper we work in superspace. The fundamental object we consider is a 'curve in super- 
space', a sequence of 3-geometries, labelled by some parameter A. We use A rather than t because in our approach 
time emerges from geodesic-type curves on configuration spaces. This timeless approach is motivated below and in 

®- 

For well-known technical reasons, we cannot work directly with superspace; to integrate and perform other key 
mathematical operations, we must introduce coordinates. Therefore the actual objects we consider are curves of 
3- metrics gij(x*,A), for which we define an action A. It must not depend on the choice of coordinates, i.e., A must 
be truly a function on curves in superspace rather than just on curves in Riem, the space of 3-metrics. This leads us 
to the first key property of the BSW-type actions. 

We know that A will depend on gij, its spatial derivatives, and also on its 'velocity' dgij/dX. Moreover, A must be 
coordinate invariant. In the case of A-independent transformations, all we need ensure is that the integrand of A is 
a 3-scalar, i.e., that we use covariant, rather than ordinary, derivatives and that the indices upstairs and downstairs 
match. However, crucially, we must also consider A-dependent transformations, i.e, ones that differ from slice to slice. 
They have the form a;* a;" {x^ , A) with inverse x^ (x'* , A) . We now find 

dg',^ dx- dx' (dgab , , 

t,a;b — t.b 



dX dx'' dx'i \ dX 

where 5° = —dx°'/dX, so the A derivative does not transform as a 3-tensor. This is a general problem that arises 
whenever the 'points' of a configuration space have an internal (gauge) symmetry. Indeed, it will be argued in 0| 
that this is the defining characteristic of any gauge theory. In the present case, the 'gauge' corresponds to coordinate 
transformations on space. We overcome this problem by having the metric velocity always appear as part of the 
combination kab — dgab/dX — ^a;6 — £,b-a, where we regard as an arbitrary 3-vector field and minimize the action 
with respect to it as well as with respect to the metric. This process of adjusting for the gauge freedom is universal 
and is appropriately called best matching. It was introduced to implement the conviction of Leibniz, Mach and many 
others that motion is relative We will write £,a\b + £,b:a = {K^)ab, because this expression is the Killing form of 
the vector ^* . It is also the Lie derivative of gij along . A further point is that we are constructing geodesic-type 
actions, which requires A to be reparametrisation invariant (see below). This will be guaranteed because is actually 
a velocity, so that the combination kij = dgab/dX — S,a:b ~ S.b;a is linear in 1/dX. 

Initially, we consider actions which depend only on the metric and its velocity (and the vector ^). We will later 
extend the configuration space to include a scalar function 0(a;*,A) and a 3-vector field A^{x'^,X). The velocities of 
these fields will also include best-matching corrections. Since they are based on the Lie derivative, all such correction 
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terms have a universal nature. However, their universal common origin manifests itself in remarkably different ways 
because each different bosonic field has its own different Lie derivative. Eqns. (1.1)-(1.4) together with the expression 
for the extrinsic curvature show how best matching is implemented in GR through the Lie derivative. We shall show 
how this fact coupled with the second key property of BSW-type actions leads to the novel derivation of GR. 

We now turn to this second key property of BSW actions, the square roots, which make them similar to the action 
of Jacobi's principle of classical mechanics. In fact, as Lanczos notes in his beautiful book on the variational principles 
of mechanics , Jacobi-type square-root actions were effectively used by Euler and Lagrange in their application of 
Maupertuis's principle. However, it was Jacobi who clarified the mathematics and cleanly separated two different 
principles. The first is Hamilton's principle based on the familiar T — V Lagrangian. It gives dynamical curves in 
configuration space together with the speed at which they are traversed with respect to an independent external time 
variable. The second is the principle now known as Jacobi's principle, which merely gives the dynamical orbit in 
configuration space independently of the speed at which it is traversed. It is, in fact, a geodesic principle based on the 
square root of a form quadratic in the particle displacements. In the Kepler problem, it yields the elliptical planetary 
orbits. We shall be interested in their generalization to general dynamical systems. 

It is important to distinguish between dynamical orbits determined by a Jacobi-type principle and group orbits, 
generated in the configuration space by the action of a gauge group. For us, the most important example of a group 
orbit arises from coordinate transformations on our 3-spaces. These correspond to 3-diffeomorphisms. All 3-metrics in 
Riem that can be carried into each other by 3-diffeomorphisms lie on one orbit of the three-dimensional diffeomorphism 
group. Since the notion of group orbits is well known, it is enough to mention it. 

However, it is necessary to say why Jacobi-type actions are needed in the attempt to treat the universe as an 
isolated dynamical system, as we do here. Newton took time as an independent existent. But it is read from clocks 
and ultimately cannot be anything but a clock reading. Clocks, in their turn, are simply mechanisms that participate 
in the universe's dynamics. The theory of time and clocks must be derived from the dynamics of the universe The 
fact is that speed is not obtained by dividing an infinitesimal displacement by an infinitesimal time but by another 
displacement. This was manifest, even if easily forgotten, when the rotation angle of the earth was the measure of time. 
A moment's reflection should persuade the reader that true dynamics establishes correlations between displacements, 
not between displacements and time. Now a curve in a configuration space is the complete expression of evolving 
correlations between displacements. Therefore, universal dynamics is the study of such curves. Moreover, the simplest 
such curves, i.e., the ones determined by the fewest data, are geodesies. This is why we study geodesies and Jacobi 
actions. For more details, see 

In Sec. 2, we present Jacobi's principle for point particles. The Jacobi action is the product of the square root of 
a potential term multiplied by the square root of a kinetic term. We shall discuss two different Jacobi-type actions. 
One is the standard, 'good', one; the other is a nonstandard, 'bad', one. In the first, the quadratic kinetic terms 
of the particles are summed, and then we take the square root of the sum. In the second, we take the root before 
we sum. We have found, to our surprise, that the 'bad' choice, which is realized in the BSW action ( |1.4| ), is a good 
idea. It leads to a massively over-constrained action but, almost by magic, selects GR, in the BSW form, as the only 
nontrivial case that works. We should like to mention here that, so far as we know, the first person to realise that 
BSW both was a Jacobi-type action and that it had a local square root and thus was not of the standard form was 
Karel Kuchaf. He pointed this out to one of us (JB) in 1980 (see e.g. |^). Karel's observation, which was followed 
over the intervening years by innumerable discussions with JB, was decisive for the eventual appearance of this paper. 

Thus, we take a reparametrisation-invariant and coordinate-invariant Jacobi action of the form 

A J (^^J ^/g^^d^x. (1.5) 

Formally, this is defined on the Cartesian product of Riem (the space of three-metrics) and the space of the fields ^ 
considered earlier. However, the intention is to turn it into an action defined on superspace by varying w.r.t. ^. The 
parameter A labels the 'points' on trial curves in Riem, g is the determinant of gij, the 'potential' P is a 3-scalar 
constructed from gij and its spatial derivatives, and T is a kinetic term. 

We assume that T is quadratic in the 'corrected metric velocities' kij. Then either the 'good' or the 'bad' square root 
makes the Lagrangian linear in 1/dX and reparametrisation invariant because of the dX outside. Dirac 1^ noted that 
any action that is homogeneous of degree one in the velocities will have canonical momenta that are homogeneous of 
degree zero. They must then necessarily satisfy a constraint that is primary, i.e., follows without any variation already 
from the mere form of the momenta. Now Noether's theorem (Part 2) tells us that reparametrisation invariance, which 
for general actions of the form (|l.5|) just allows a uniform scaling of the velocities over the entire manifold, will yield 



one constraint. In contrast, the 'bad' actions of the form (1.5) generate an independent constraint at each space point 



This mismatch between the constraints and the symmetry will be crucial. 



We now seek stationary points of the action A (1.5) as a functional of gij and the gauge field ^\ We first compute 



p^^ , the momentum conjugate to gij. This is the variation of A w.r.t. dgij/dX which, in turn, is identical to the 
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variation w.r.t. kij. The variation of A w.r.t. ^ is the variation w.r.t. kij (which is just p^^), times the variation of fey 
w.r.t. which is just the gradient of the variation of We integrate once and get the standard ADM diffeomorphism 
constraint, p^-' = 0. This result has nothing to do with the square root or any other structure of the action except the 
use of 'best matching' to implement diffeomorphism invariance. As we add extra fields, we will get extra velocities, but 
each will have the appropriate Lie-derivative correction with respect to the same vector ^. The striking universality 
of the results obtained later arise because all fields in nature are subject to the same diffeomorphisms generated by 
the one single field ^. 

Variation of the terms that involve the extra non-metric fields will generate source terms in the diffeomorphism 
constraint, but the universality of the Lie-derivative prescription ensures that their form will be independent of the 
Lagrangian's detailed structure. As a result, the diffeomorphism constraint seems to be kinematic in origin, reflecting 
merely the 3-tensorial nature of the considered fields. This is wrong. The diffeomorphism constraint reflects one of 
the two key properties of the dynamics, namely, that its purpose is to make the action depend solely on the group 
orbits that constitute the points of superspace. 

Because the coordinates are more or less freely specifiable at each point of the 3-manifold, the resulting symmetry 
(an arbitrary 3- vector at each point) matches the constraint (the vanishing of a 3-tensor divergence at each point). 
Therefore, we have no reason to expect that the evolution will fail to propagate this constraint. 

In the terminology of Bergmann adopted by Dirac the diffeomorphism constraint is a secondary constraint 
because it arises from variation (the associated primary constraint is the identical vanishing of the momentum con- 
jugate to ^ because A does not depend on d^/dX). However, as noted above, there is already a nontrivial primary 
constraint in the system. It follows from the mere definition of p^^ that a quadratic expression of the same basic 
form as the Hamiltonian constraint of GR vanishes identically at each point of the 3-manifold. However, if we put 
the square root needed to construct a geodesic-type action outside the integral sign, we get an integral rather than a 
point constraint. Then the single resulting constraint per manifold matches the one degree of freedom per manifold 
due to the reparametrisation invarian ce. There is no longer an independent constraint at each point of the manifold. 
This is why we say the 'bad' action ( |l.5| ) is over-constrained, and why it is no surprise that, in general, it does not 
generate sensible dynamics. We call the Hamiltonian-type constraint the square-root constraint because it arises from 
the placing of the square root. 



Our initial motivation for studying the actions ( |l.5| ) was because the BSW action for GR has this form. In Sees. 
IV and V we give a fairly complete account of the BSW action, especially the propagation of its constraints. We then 
make the Legendre transformation from velocities to momenta and rediscover the ADM Hamiltonian with its well- 
known complete freedom to 'push forward the evolution of spacetime' by different amounts across the instantaneous 
spacelike 3-manifold. This remarkable property of GR, which goes by various names including foliation invariance 
and 'many-fingered time', is thus the 'hidden symmetry' of the BSW action. It shows that an over-constrained system 
can nevertheless be viable. 



Section VI contains our first new result. We consider a general action of the form (1.5) on superspace. Specifically, 
we assume: 1) T is the most general quadratic kinetic form allowed by a natural locality requirement; and 2) the 
potential term P is an arbitrary scalar function of and its spatial derivatives. We check the propagation of the 
constraints by the equations of motion and find that in general the square-root constraint is not propagated. We 
need a 'hidden symmetry' in the Euler-Lagrange equations to match the extra constraints on the initia l da ta. From 
this over-constrained perspective, it is remarkable that nevertheless there exists a handful of actions (^]^) that do 
propagate the constraints. In them, T must have the standard DeWitt form (a result already obtained some years 
ago by Giulini ||l^) and P = DR + A, where D and A are constants and R is the scalar 3-curvature. By an overall 
scaling, we can set D = —1,0, or -1-1. The cases D = -1-1 and D — —1 give Lorentzian and Euclidean GR, respectively, 
both with A as a possible cosmological constant. The D — case is called strong coupling or strong gravity and is the 
limit of standard GR for an infinite gravitational constant |l^] . One can further argue that the D = 1 case is the only 
one for which the equations of motion can be written in hyperbolic form and thus is the only viable candidate for a 
dynamical system. All other actions force the 3-metric to be fiat and its velocity to vanish. Unlike the square-root 
constraint, the diffeomorphism constraint is always propagated, as expected. 

In Sec. VII, we see if the kinetic and potential terms for a scalar field (p can be added to BSW gravity in the 
Lorentzian case, which has an emergent 'light cone'. Best matching uniquely determines the kinetic term, but there 
is wide freedom in the potential, for which we make the ansatz 

R^ R-^g'"'cl),acl>.b + J2Ar.r- (1-6) 

n 

This is the standard (minimal-coupling) potential energy (with an arbitrary constant C) and a polynomial with 
constants An. The polynomial, which could in fact be replaced by an arbitrary function of (j) (but not its derivatives), 
gives no difficulty, but we must have C = 1 for the constraint algebra to close. Therefore, the canonical speed of <j) 
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must match with the speed of the metric disturbances: the scalar field must respect the gravitational light cone. This 
result surprised us even more than the recovery of GR from pure 3-space dynamics, especially since the manner of its 
derivation showed clearly that we had found a possible universal method of generating the SR light cone. (Until we 
extend our approach to include fermions, we cannot claim that 3-diffeomorphism invariance coupled with the local 
square-root is the origin of the universal light cone.) 

The result we have just described shows that minimal coupling (and with it Einstcinian free fall) arises naturally 
in the BSW approach. In Sec. VII, we also discuss a general range of 'dilaton' theories and show that, in many cases, 
they can be written in the local square root form. Brans-Dicke theory is a special case of such theories. We express 
its 'vacuum' form as a BSW-type action. We further show that we can self-consistently couple in a massive scalar 
field in the Brans-Dicke frame. Therefore, it seems that Brans-Dicke theory is consistent with our approach 

Life becomes even more interesting when we try to couple in a 3- vector field, which we denote by Aa (Sec. VIII). 
The kinetic term is again uniquely fixed. We consider a fairly general potential energy expression for Aa'. 



Ua = C^Aa.bA'^'' + C^Aa.bA''-''^ + C^A%A% + ^ Bfc(AMJ^ (1.7) 

k 



where Ci and Bk are constants. We include, thus, all possible terms quadratic in the first derivative plus a general 
undifferentiated polynomial term. We find that, at this level, the polynomial terms do not hinder propagation of 
the square-root constraint but that we must have Ci — —1/4, C2 = +1/4, C3 = 0. Therefore, the potential-energy 
term becomes -(curM)^/4. This means that here too we recover the common light cone for the vector field from our 
putative universal method. 

But now there is an even more striking result. As we shall see, the structure of the 3- vector field is significantly 
more complicated than in the scalar case. It generates not only the above three conditions on the three constants C^, 
but also an extra secondary constraint requiring the divergence of the vector momentum to vanish, essentially the 
Gauss constraint. This too must propagate, and we can easily show it does so only when all the polynomial terms 
vanish. This means we have derived standard Maxwell electrodynamics. We find this result truly striking. It suggests 
that gravity, the light cone, and massless electrodynamics all arise from the local square root and the action of the 
three-dimensional Lie derivative. 



In a companion paper |12| we show that the 3-metric field, the vector field and scalar fields can interact among 
themselves only in the form of complex scalar fields with U(l) gauge coupling to the vector field and both scalar fields 
minimally coupled to gravity. Thus we obtain classical gauge theory. A paper just completed by Edward Anderson 
and one of us [1^] shows that in the BSW framework 3- vector fields can interact among themselves only as Yang-Mills 
fields minimally coupled to gravity. 

The 3-space approach uses nothing but manifold geometry, the mathematics that arose when Euclid's fifth axiom 
was seen to be independent and could be omitted. It uses topology and bosonic fields: scalar, vector, and 3-metric. 
As geometrical objects, these are on a par. But their individual properties lead to a hierarchy. The 3-metric is first 
among equals. Through its covariant derivative and determinant it creates tensor calculus and the densities that 
permit integration and variation. It creates the highroad of dynamics p^ . Moreover, the Lie derivatives impose a 
chain that arises from the intricate core of the metric Killing form: gravity, light cone, gauge theory. The effectiveness 
of geometry in establishing this structure and logic in dynamics is impressive. Galileo said: "He who attempts natural 
philosophy without geometry is lost." He meant 3-geometry. Was spacetime an accident? 



II. SQUARE ROOT ACTIONS 



We first formulate Newtonian particle mechanics without time, using Jacobi's principle. It describes the orbits of 
conservative dynamical systems in configuration space without reference to motion. Jacobi's principle is very well 
known to A'^-body specialists, who geometrize motion in configuration space by exploiting the Riemannian metric 
defined there by the kinetic energy of conservative mechanical systems However, it is virtually unknown among 
relativists, who use spacetime to geometrize motion. We have here a 'cultural divide' that we hope the present paper, 
which suggests that the configuration space is more fundamental than spacetime, will help to overcome. 

For N particles, labelled by (m) = 1, . . . , A^, of masses m(m) moving in a potential ^(a^jij , . . . , the Jacobi 

action is 

/q = 2 / VE~vVTdX, (2.1) 
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where the constant E is the total energy, V is the potential energy, A labels the points on trial curves and 



^ "^(m) '^^(m) dX(m)i 



E 

(m) = l 



dX dX 



is the kinetic energy but with A in place of Newtonian time. The subscript G on Iq distinguishes the 'good' action 
from the 'bad' one introduced later. The action (2.1) is timeless since the label A could be omitted and the mere 
displacements c?X(,„)j employed, as is reflected in the invariance of Iq under the reparametrization 



A^/(A). 



(2.2) 



Jacobi's principle describes all Newtonian motions of one E as geodesies on configuration space. Its square roots 
are characteristic, indeed essential, and are central in the 3-space approach. They fix the structure of the action's 
canonical momenta, which are 



P{m) 



sc 



E -V dx 



(m) 



T 



dX 



(2.3) 



Because T (quadratic in the velocities) occurs under the square root in the denominator while the velocity occurs 
linearly in the numerator, the canonical momenta are homogeneous of degree zero. They resemble direction cosines, 
which, if squared and added, give 1. We have 

P\m)P{m)i _ E-V dx\^~^ dX(rn)i 



2m 



(m) 



T 



dX dX 



If we sum the kinetic energies, they give T, cancelling the numerator's T. This gives the reparametrisation, or 
square-root, identity 



N 

E 

(m) = l 



P\„^)P{m)^ _E-V 



2m 



(m) 



T 



xT = E~V. 



(2.4) 



Equation (2^) seems to express energy conservation, but it is ac tual ly an identity true on all curves in the configuration 
space and not merely on-shell. In the Hamiltonian formalism, (2.4) becomes a quadratic constraint. 
The Euler-Lagrange equations are 



dpl 



SC 



dX 



Sx 



{m)i 



T 



E-V 



V, 



(2.5) 



where A is still arbitrary. The solutions of Eq.(2.5) are, as they must be, parametrised curves in configuration 
space. Within generalized Hamiltonian dynamics Q , there is no guarantee that the Euler-Lagrange equations used 
t o p ropagate the canonical momenta will do so in such a way that the identity ( ^.4[ ) will be maintained. This is why 
(2^), and any other identity like it, is to be regarded as a constraint whose propagation must always be checked. In 
the present case, the Euler-Lagrange equations do conserve the constraint: 



N 

E 



P{m)i dp" 

dA 



(m) = l "^(™) (m)^! "^'"-! ' ' (m) = l 

But this is not the end since A is still free. If we choose it such that 

T 

l^T = E -V 



N 

E 



P{r. 



T 



E-V 



V = - 



N 

E 



E -V dx 



(m) 



T 



dX 



T J y_ dV 



E-V 



(2.6) 



(2.7) 



then Eqs.(2_^) and (2^) become 



dx 



P{n 



(m) 



dp, 



(m) 



dX 



dX 



~d\m)V, 



(2., 



6 



and we recover New ton' s second law with respect to this special A, which has the same properties as Newton's absolute 
time. However, Eq.(2/7), which is usually taken to express energy conservation, becomes, in the absence of an external 
time, the definition of 'Newtonian' time. Indeed, this emergent time is the astronomers' operational ephemeris time 



We now consider the 'bad' (suffix B) Jacobi action with square root and summation swapped: 
where E, V and A have their previous meanings, and 



t(m) 



dA, 



(2.9) 



(m) = l 



T(m) = 



'•"(m) '^^l-m) dX(,n)i 



is the nrfi^ particle's kinetic energy. 

We will generalise this slightly by bringing the 'i? — V inside the summation sign. Then each particle can have its 
own energy and potential, and we have 



1^ = 2 



N 

E 

(m) = l 



(m) 



V, 



(m) 



T(m)dX, 



(2.10) 



In Jacobi's action, the 'distance' in configuration space is an iV-dimensional Pythagorean sum, i.e., the square root 
of a sum of squares (with mass-weighted 'legs'). In the new action, we add the c?s's along a line. In the 'good' ds, the 
change due to a change in one leg depends also on what all the other particles are doing. This linkage is not present 
in the 'bad' ds. 

The new canonical momenta are 



P{m) 



5C 



IE, 



'(m) \ 



(m) 



T, 



(rn) 



dX 



(2.11) 



Now to the point. Instead of just one in the 'good' case, there are now N independent identities, one for each 
particle. They arise just due to the placing of the square root. For this one core mathematical relationship, we shall 
use various names: reparametrization or square-root identity; Hamiltonian, quadratic or square-root constraint. The 
new identities are 



2m 



The Euler-Lagrange equations 



(m) 



d\ 



T, 



X T(rn) — Elm.) — V( 



(rn) 



(m) 



sc 



N 



Sx 



(2.12) 



(2.13) 



do not now in general preserve the constraints (2.12). There are extra secondary constraints, one for each particle: 



_d_ 



2m, r„-, 



E, 



(m) 



(m) 



+ Z^(n) = l"(«) *^(™)m(„) 



(2.14) 



Here, we inverted the expression ( 2.11 ) for the momenta to find dxfm^i/dA and substituted to obtain the last line. 
This does not vanish identically and thus is a new secondary constraint. We cannot have such proliferation, so are 
there any cases for which the expressions ( 2.14 ) vanish? 

The easiest way to understand their implications is to consider special initial data. Suppose particle (1) is moving 
and all the others are instantaneously at rest. Take Eq.(2.14) for (to) — (2). If it vanishes 
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C?(1)V(2) 




Til) 



(2.15) 



0. Taking different (m) and (n), we find c'(„-)V(„) = OVn 7^ m, i.e., can only depend 



This requires c?|j^^V(2) 

on a:^^^ and not on the other particles' coordinates. Then from Eq.(2.14) we find that all the secondary constraints 
are identically satisfied, with no restriction on V(„) . The particles can each move in their own external potential but 
cannot interact among themselves. 

Physically, the system starts off with one global reparametrisation invariance. However, each particle wants to be 
independently reparametrisable. To see this, we seek an equation to propagate the kinetic energy T(^m) of ^^ny one 
part icle. None exists. The direction of each particle is fixed, but its speed is arbitrary. To see this another way, note 
[Eq.(2.11)] that the momenta are homogeneous of degree zero in the velocities. The y de pend only on the direction of 
motion of each particle and not at all on its speed. The Euler-Lagrange equations ( 2.13| ) govern the momenta, which 
are directions, and thus say nothing about speeds. This is why the system makes sense only when the particles do 
not interact. We may also note that the local square- root action is the sum of N independent single-particle Jacobi 
actions. The particles cannot interact; but if they do not interact each can be independently rcparametrised. 

The 'bad' Hamiltonian is illuminating. It is just the sum of the constraints, each with a Lagrange multiplier: 



H 



B 



N 

E 

(m) = ] 



N, 



(m) 



P{m)P{m)i 



2m 



E, 



(m) 



(m) 



(m) 



(2.16) 



Variation w.r.t. the multipliers N^^^) gives the constraints. Hamilton's equations are 



dp) 



(m) 



dt 



dx 



(m) 



dt 



N, 



(m) 



2m 



(m) 



(2.17) 



These are exactly the equations of motion (2.11) and (2.13) from /g with the identification N^ 



Hrn) 



V(m)- Once again the kinetic energy is seen to be undetermined. 



r. 

The 'bad' action (2.10) evidently has interesting and important properties: The system is over-constrained and the 
constraints do not, in general, propagate; the 'defect' acts as a filter, picking out the 'simplest' action; the special 
action has a 'hidden' symmetry (not explicitly exhibited in the action); the 'b ad' Hamiltonian is just a sum of the 



constraints. Alternatively, one could start with a Hamiltonian such as Eq.(2.16) which is a sum of constraints, each of 
which is quadratic in the (undifferentiated) momenta. Now all one has to do is repeat the BSW reduction procedure. 
First one constructs a Lagrangian by means of a Legan dre t ransformation. This will contain the Lagrange multipliers. 
These are eliminate d and one ends up back with Eq.( 2.1C ). All these properties carry over to the extension of the 
'bad' Jacobi action ( ^.10 ) to a field theory, in particular to gravity. 



III. REPARAMETRISATION-INVARIANT ACTIONS AND GRAVITY 

As we already said at the start, 40 years ago Dirac Q and Arnowitt, Deser and Misner (ADM) showed that GR 
could be treated as a dynamical system on the configuration space of Riemannian 3-geometrics with fixed topology 
(called superspace by Wheeler [Q). We work in this paper with compact manifolds without boundary; the extension 
to more general topologies (e.g., asymptotically flat) is straightforward. 

We now begin the programme outlined in the introduction. We shall consider a class of Jacobi-type actions on 
superspace. We will show that the requirement that the equations of motion be strongly local (in a sense to be defined) 
and self consistent drastically restricts the actions. In fact, standard GR with locally Lorentz-covariant matter fields 
that interact through the gauge principle emerges naturally. We begin with matter-free superspace and obtain pure 
geometrodynamics. 

For definiteness, we take the compact manifold to be S^. We will actually work in Riem, the space of suitably 
smooth Riemannian metrics gij{x),x G S'^. The equivalence classes {gij} of the metrics gij under suitably smooth 3- 
diffeomorphisms on Riem are the points in superspace. Consider a smooth family of smooth diffeomorphisms x'* {x^ , 6) , 
where the label 9 distinguishes the members of the family and the diffeomorphism goes to the identity map as — s- 0. 
The general infinitesimal 3-diffeomorphism of gij (x) is generated by the Killing form by 

|e=0 - l^ia^, = {m^J = V,;^, + V,?., (3.1) 



8 



where ^'(x) = dx''^ /dO evaluated at = 0, and C indicates the Lie derivative. The KiUing form, with its multi-index 
covariant derivatives Vj, is the most intricate Lie derivative we shall encounter; propagation of its intricacy creates 
gravity and imposes deep universal structure on matter coupled to gravity. The Killing form drives it all through the 
square-root filter. 



We seek a measure on superspace like the 'bad' Jacobi action (2.10). Consider two metrics on, say, , which we 



call (j*^^^ and i?'-^-'. We take coordinates with the same range, so that we can identify points on the two manifolds with 
the same coordinate values. We can compute, at each point, the difference between the metrics as dgab = 9^ab ~ 9ab' 
where we take dg to be small but finite. We want a difference linearly proportional to dg, something like [dgY- 
There are two natural squares of a symmetric two-index tensor. One is the sum of the squares of the individual 
elements, while the other is the square of the trace. We include both with an as yet undetermined coefficient, and, 
maintaining maximum generality with strong locality, we consider the object 

T = G^^^^^^ldgabWdgcd], (3.2) 



Qabcd ^ g. 



b „cd 



(3.3) 



where A is the undetermined constant coefhcient. Clearly, ^Jt is a natural measure of the difference of g*-^^ and 5'^' 
at one coordinate point. To find the total difference between g*^^) and g^^', we need to integrate over S^. To obtain a 
more general measure, we can modulate ^/t by some spatial function of the metric, which we call ^fP, to give 



Let us assume that all the unlabelled g's are i^^^^'s and integrate over the first manifold. The second ma nifol d is an 



V~9 



PVrd^a 



(3.4) 



equally good base that gives an answer whose slight difference is i mmat erial in the limit. It is clear that (3^) is the 
natural generalization to a field theory of the 'bad' Jacobi action (2.1C). We have merely replaced summation over 
particles by integration over a field. 

This is a distance on Riem. To get a distance on superspace, we make different coordinate transformations on each 
of the two manifolds, getting g'^^-' and 5''^'. We recalculate A(f;'^^-' , g'^^-*) and, in the key step, find the minimum of A 
over all possible transformation pairs. The minimum, provided it exists, will be a function on superspace. In fact, we 
only need to make transformations on the second manifold, because identical transformations on the two manifolds 
leaves A unchanged. 

To transform this finite-distance function A into an infinitesmal, consider a curve of metrics with label A that 
interpolates g'^^^ and g^^\ Evaluate dA/d\ at any A and get 



dA 



= J 



Td^3 



with 



T = G' 



abed 



dgab 
dX 



dg. 



cd 



dX 



(3.5) 



with G°'"='* given by (U). 

To take into account the A-dependent coordinate transformations, we add a Killing form to dgab/dX, dgab/dX —f 
dgab/ dX — {K(,)aby where ^ is a priori an arbitrary 3-vector field. The 'distance' along the curve between g^^^ and g^^' 



extremum with respect to ^ of / o?A 



VpVt. 



(3.6) 



The extremalization in (3.6) w.r.t. ^, which we call best matching j^, makes A into a measure on superspace 
and not just on Riem. We use the word measure, as opposed to metric, because we are actually about to create a 
degenerate (albeit highly interesting) structure. The ideal, motivated in |^ by an idea that goes back to Poincare, 
would be to have a true metric on superspace with geodesies as dynamical curves. However, because we choose the 
'bad' action, we will find that although a best-matching action is in principle defined on all curves in superspace there 
are many curves joining two given points in superspace that have the same action. This degeneracy would not have 
occurred had we opted for the 'good' action. The consequence of our 'bad' choice coupled with the square-root filter 
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is to acquire an extra symmetry, which, unhke the 3-diffeomorphism symmetry on the configuration space, acts on 
the phase space. 

It is worth relating this to the number of physical degrees of freedom of the gravitational field, which is universally 
agreed to be two if one rules out tensor-scalar gravity of Brans-Dicke type. Now, since the 3x3 symmetric tensor 
gij has six independent components of which three correspond to coordinate freedom, a true geodesic theory on 
superspace must have three physical degrees of freedom per space point. Our 'bad' choice introduces an extra scalar 
symmetry and takes us down to two physical degrees of freedom. Our readers might begin to suspect that we are 
merely taking them on a roundabout route to standard GR. 

To counter this worry, we first note that even if it is indirect the way is new. We get to the goal with much reduced 
kinematics - there is no time and no presupposed Minkowskian spacetime structure in the small. More significantly, 
we get new insights into the origin of the universal light cone and gauge theory. We should also like to draw attention 
to our earlier paper in which we apply best matching and a local square root on conformal superspace (CS). 
Since the 3-vector symmetry of the 3-diffeomorphisms is augmented on CS by the scalar symmetry of 3-conformal 
transformations, the six degrees of freedom on Riem are directly reduced to 6-3-1 = 2 on CS. Remarkably, the use 
of the local square root in this case still acts as a filter of theories but does not reduce the physical degrees of freedom 
any further. There is no extra phase-space symmetry associated with the theory on CS. Since the coupling to matter 
on CS has still to be worked out in detail, we shall not explore this route further in the present paper. But it does 
hint at interesting possibilities (see also 

Returning to the task in hand, the variational principle on superspace, the final step is to seek the curve between 
(7^^'' and g*-^^ that minimizes the best-matching 'distance' between the two geometries . This is achieved by varying 
w.r.t. the metric gij. 

Note that T as written is the most general ultralocal (i.e., with the supermetric G'^^'^'^ dependent only on and 
not on its spatial derivatives) quadratic form in dgij up to an overall constant (which cannot affect the resulting 
geodesic curves). As we noted, the first term is the sum of squares and the second the square of the trace. The 
relative contributions of these two scalar terms has yet to be determined, hence the coefficient A. For A = +1, G'^'"^'^ 
is the DeWitt supermetric. 

The above analysis shows that in the action is the rate of change of a three-dimensional coordinate transformation, 
i.e., it is a 'velocity'. Moreover, the action does not depend on the associated 'position' (three-dimensional coordinate 
transformation). Thus, it appears that ^ should be treated as a 'cyclic' or 'ignorable' coordinate rather than as a 
Lagrange multiplier (a 'position' without a 'velocity'). In fact, it will be shown in that ^, like all such variables used 
to implement best matching, is neither a multiplier nor a cyclic coordinate but a sui generis variable for which the 
variation at the end points is not fixed. The same analysis will show that in most cases, including the one considered 
here, the treatment as a multiplier is valid, though there are cases in which it is not. In this paper, we shall therefore 
regard ^ as a multiplier, which matches the standard treatment in gauge theory. 



We conclude this section by underlining the presence of the local square root in (3.6), i.e., the square roots are 
taken before the integration over space. This is the analogue of the 'bad' Jacobi action. The analogue of the 'good' 
action with 'global' square roots is 



/ = <Px^P^ J <Px^T. (3.7) 

Such an action yields geo mctr odynamics with three freedoms per point. Moreover, the global square roots do not 
'filter'. The one basic form (3.7) allows many instantiations. In contrast, mere consistency applied to candidates for 
P and T in (3_^) and its obvious generalization to scalar and 3-vector fields on Riem leads directly to matter- free 
gravity (Sec. V), to matter fields minimally coupled to gravity and obeying its light cone (Sees. VI and VII) and to 
the gauge principle and massless electrodynamics (Sec. VII). These are all latent in (|3.6| ). We must now make them 
appear. 

IV. THE BSW ACTION AND ITS HAMILTONIAN FORM 

The Baierlein-Sharp- Wheeler (BSW) action ||l| has the form 

^BSW = j '^^j Va^Vfd^x, (4.1) 

where the 'kinetic energy' T is 
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dgab 



dg. 



cd 



dX 



(4.2) 



the monotonic A labels the 3-metrics gij{x,X) on a curve in Riem, R is the scalar curvature of gij, and the other 
symbols have been explained. 

Although it was not found in this way, the BSW action can clearly be derived from the 'distance' (3.6) with A — +1 
and R as the 'potential' P. We first show that ^bSW ^^ads to the Dirac-ADM Hamiltonian. The Lagrangian density 
is £ = ^\fR-\fT ^ and the canonical momenta conjugate to gij are 



(5£ 



( 9gij \ 
\ 9^ J 



dg 



cd 



dX 



cd 



(4.3) 



We immediately observe their homogeneity of degree zero in the 'corrected velocities' dgab/dX — {K^)ab, which 
occur linearly in the numerator and quadratically under the square root in the denominator T. They define a ['local'] 
direction in superspace, as opposed to the direction and speed of ordinary momenta. The BSW action is therefore 
timeless, determining only paths without speed in superspace. Just as squared direction cosines sum to 1, the BSW 
canonical momenta satisfy the square-root identity 



gH 



gR = 0, 



(4.4) 



with the trace p — gabP ■ This identity is obtained by substitution of ( |4.3D into (4.4) and in generalized Hamiltonian 
dynamics is a primary constraint ||^. Another primary constraint is the ide ntic al vanishing of the momentum conjugate 
to that arises from our taking it to be a multiplier. Then variation of (|4.l| ) w.r.t as a multiplier gives 




g^'g'") 



dgcd 
dX 



(4.5) 



This, on using Eq.(4.3), gives us the secondary constraint 

= /f, = 0, 



(4.6) 



which consists of three conditions per space point. We also have the as yet undetermined vector ^, with three 
components per point. We wish to interpret (4.5) as an equation for ^. The solution of this equation is the well- 
known thin- sandwich problem [T^ ]. Suppose we can find ^ as a function of gij and dgij/dX. We can then substitute 
this back into (4.1) to get a measure on superspace that is determined solely by the 3- metric. The auxiliary multiplier 
^ will have been eliminated, and we should have a well-posed initial-value problem for gij and its A derivative. 

This is what the logic of the 3-space approach suggests. In fact, the thin-sandwich problem is burdened with difS- 
culties [T^ ], and one should see our Lagrangian as heuristic rather than practical. Fortunately, it leads unambiguously 
to a Hamiltonian form - the Dirac-ADM fully constrained Hamiltonian - that is much more tractable and in which, as 
York showed Jl6t , one can genuinely obtain a well-posed initial- value problem. Our motto is therefore: "Conceptualize 
in the configuration space, calculate in the [constrained] phase space." 

The entire evolution dynamics is in the Euler-Lagrange equations for giji 



dp'^ 
~dX 



5C 

Sgt] 



-\— (R'' 

V 4i? ^ 





(4.7) 



where C^i stands for the Lie derivative along 

The square-root identity forces the standard Hamiltonian to vanish identically, as it does for all Lagrangians 
homogeneous of degree one in the velocities. Using Dirac's generalized Hamiltonian dynamics |9j, we consider 



n = / y/g{NH + N.,W)d^x 



(4.8) 



where N and Ni are position-dependent multipliers, and H and are the constraints (4.4) and (4.6), respectively. 
Variation w.r.t. N and Ni imposes H = and — 0. 

is exactly the Dirac-ADM Hamiltonian; H is the Hamiltonian constraint; W is the momentum 



The expression ( 4 



constraint; N is the lapse; iV* is the shift. The standard equations of motion that it yields are 
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dgab ( 1 1 , , 

^TT- = 2— pab - 7:gabP + Na-.b + Nb:a 

d\ ^ \ 2 ' 



Ng'' ( ab 1 2 



Inverting (|4.3|) to obtain dgij/dX in terms of the momenta, 



V5 




and comparing ( p7| ) with ( |4l0| ) and with we get the identifications 



(4.9) 



(4.10) 



(4.11) 



(4.12) 



In fact, even after these identifications, the dynamical equations (4/7) and ([1.10| ) differ by a muhiple of the Hamil- 
tonian constraint. This has no conceptual significance because the constraints vanish, but such differences can be 
important in numerical work. Moreover, if in the ADM Hamiltonian one chooses N/ ^fg instead of N as the inde- 
pendent variable, then, as emphasized by York [ pT[ , the extra term vanishes, and one gets complete agreement. This 
shows that the BSW and ADM solution curves are closely related. 

However, they ar e no t quite identical. The AD M lapse and shift are entirely free functions, while the BSW logic calls 
for the solution of ( |4.5| ) for ^. If this succeeds, •y/T/4_R is determined. It seems that the BSW initial data determine 
a unique direction in superspace as distinct from the ADM 'many- fingered time'. This apparent discrepancy will be 
resolved in Sec. V. 



V. PROPAGATION OF THE BSW CONSTRAINTS 



For consistency, the evolution equations of constrained theories must propagate the constraints initially zero, 
they must remain so. Thus, we must have dH/d\ — and dW/dX = by virtue of (4/7) and the definition ([l.3|). 
Since constraint propagation is crucial throughout this paper, we shall exhibit it in some detail for BSW. We begin 
with t he m omentum constraint = 0, differentiating it w.r.t. A and using the BSW (not ADM) evolution equations 
(4.7)-( 4.11 ) to replace the A derivatives of gij and p^K To simplify (and make the ADM connection), we write iV in 



place of yjT /AR. After cancellations and rearrangements, we are left with 



W"'p)p'n.:,b 



N-^'^[R-g-^{p^'Pab-hP^)] 



(5.1) 



Thus, d/d\{p^-' .j) vanishes weakly if the constraint holds initially, it will propagate. For the Hamiltonian 
constraint 



d_ 



1 



gR- [p^'pab-^p- 



4N--^p\.^, + 2Np%, + C^ 



1 



gR - [p'^'Pab-^P' 



(5.2) 



so this constraint too propagates. (In actual fact, when expressed in Lagrangian, as opposed to Hamiltonian, form, 
the expressions within the square parentheses in (5.2) vanish identically. This vanishing is precisely the content of 
the square-root identities. However, both here and in the equations below, we prefer to show these terms explicitly, 
if for no other reason than that they must be present in the Hamiltonian formulation, in which, as we already noted, 
the Lagrangian identity become s a true Hamiltonian constraint. 



In the light of the definition (4.S), the evolution equations (4.7) appea r to give six equations for the six d^gij/dX^. 
The fact that (p3) propagate the four constraints tells us that only two of (|4.3|) are true evolution equations. Therefore, 



the system is highly under-determined. In particular, we expect that d{p.j)/dX = should be used to solve for d^/dX. 
Since the constraint is preserved, we end up with no restriction on dS,/dX. A similar situation is true for dN/dX. 

The upshot is that N and ^ are fixed on the initial slice by the initial data, but their evolution is free. Hence the 
freedom in the ADM lapse and shift is effectively shared by the BSW action, and both generate Einsteinian gravity. 
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In the local-square-root particle model, discussed in Section II, in the special case where the constraints are pro- 
pogated the action reduces to a sum of single particle Jacobi actions. Therefore the action is locally reparametrisation 
invariant even though it is expressed in terms of a glabal parameter. This is not true in the BSW action. The spatial 
derivatives in the scalar curvature are evaluated at a fixed 'time'. If we make a 'local' parameter change we will 
change the surfaces of constant time and thus change the derivatives in a complicated fashion. Therefore the BSW 
action remains only globally reparametrisation invariant. 



VI. UNIQUENESS OF BSW 

We have seen that constraint propagation is important. Many have sought conditions under which GR can be 
derived. Two main strategies have been followed. The older classical arguments, reviewed by Hojman, Kuchaf, 
and Teitelboim (HKT) relied on four-dimensional general covariance coupled with simplicity restrictions in a 
Lagrangian framework. These essentially select the Hilbert action uniquely (up to an arbitrary cosmological constant). 
More recently, Teitelboim [fsl started from a Hamiltonian viewpoint and deduced matter-free GR by postulating: 



1) that the Hamiltonian should have the local form (4.8); 2) that H and should depend only on the 3-metric 



and its conjugate momentum p^^ ; and 3) that the resulting dynamics should satisfy an embeddability criterion 
proposed by Wheeler: "If one did not know the Einstein-Hamilton-Jacobi equation, how might one hope to derive it 
straight off from plausible first principles, without ever going through the formulations of the Einstein field equations 
themselves? The central starting point in the proposed derivation would necessarily seem to be 'embeddability' [in a 
four-dimensional pseudo-Riemannian spacetime]." 

As Teitelboim noted in his PhD thesis this is an extremely restrictive condition. Developing an approach of 
Dirac j|] , he showed that embeddability imposes a strict requirement on the Poisson-bracket relations between H and 
H^. They must satisfy the so-called Dirac algebra. In [||, HKT then sought theories in which the manner in which 
the constraints close ensures embeddability and showed (again with certain simplicity requirements) that GR is the 
unique theory that does so. 

As our first new result, we show that embeddability is a much stronger condition than one needs. The constraint 
algebra need not close in a specific way. It is merely necessary that it close. As we shall see, this opens up an 
entirely new derivation of relativity - both the special and the general theory - in which no a priori assumption of 
geometrodynamic evolution of spacelike hypersurfaces in a four-dimensional pseudo-Riemannian spacetime is made. 
We can derive relativity without relativity merely by postulating an action based on a metric 'distance' of the form 
( |3.6| ) and requiring that its constraints propagate. 

Bes t m atching, which ensures 3-diffeomorphism invariance, automati cally leads to a momentum constraint of the 
formdU). The local square root leads to a local square-root identity like (|4.4|) , which becomes a quadratic Hamiltonian 



constraint like ([4.5|). We are led naturally to a local Hamiltonian of the form (4.8). Both constraints strongly restrict 
the possible Lagrangians (or Hamiltonians) through the condition of constraint propagation. 

We make no attempt at an exhaustive analysis and employ a pedestrian technique. We suspect our various individual 
results could be obtained more elegantly in a unified manner but think it premature to seek it at this stage, since there 
are several extensions of the method, which we shall mention at the end of the paper, that should first be explored. 
In the meanwhile, our individual results show the potential of the 3-space approach. 

We start with the simplest modification of the BSW Lagrangian: changing the coefficient A in the supermetric 
from the DeWitt value A — 1. The inverse to the supermetric is gaegbf — j^zi9abgef because 



A 

gaegbf - 3^ _ ffabge/ 



[ffV' - ^5"V1 = (6.1) 



We define 



B = (6.2) 
3A - 1 ^ ' 



because when A — 1 we also have B — 1. 
Hence we start with a modified BSW action 



^BSW " / / Vg^^d^^ (6-3) 



with 
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ab ^cd\ 



T = {g-^g"^ - Ag'^'g 



The canonical momenta conjugate to g^j are 



5L 



This can be inverted to give 



dgab 



{g^'g^" - Ag'^g-'') 



imab 



dg. 



cd 



dg. 



cd 



ax 



cd 



(6.4) 



(6.5) 




B 



Pi] 



g,jP + 6;j + ^j; 



The square-root identity becomes 



gH 



-p''pij + jp^ + fjR = 0, 



while the linear constraint that arises from varying w.r.t. ^ is unchanged: 
The evolution is again in the Euler-Lagrange equations for g^j-. 



dp^- 



SC 



T 



dX 6g. 



gR 



B 



(6.6) 



(6.7) 



(6.8) 



(6.9) 



We use equations (5.6) and (6.9) to evolve the constraints, obtaining 



d_ 
dX 



2N_ (^im _ B im \ b 
^ [P 2 9 P)P rn;b 



(6.10) 



The A derivative of the momentum constraint, being proportional to itself, vanishes weakly, and so the constraint 
propagates. However, for the Hamiltonian constraint 



d_ 

dX 



1 



B 



V9R-^[ P'^'Pab - -P 



AN-'y\.^, + 2Np-\, + {^) Np yZjR - {p-'pab - §P^) 
+ {2B - 2) NV^p + (4S - 4) N'^'p.i 



VgR-^ip'^'pab-^p'' 



(6.11) 



The right hand side of ( |6.11 ) does not vanish weakly. It is clear that the trace p must satisfy the secondary constraint 

p = constant. (6.12) 

This is the well-known constant-mean-curvature (CMC) gauge condition |jl^, and it severely restricts the initial data. 
It also forms a second-class constraint with the Hamiltonian, with which it does not commute. When we evolve 
this constraint, we get the standard CMC slicing condition: 



V^iV - RN ^C, 



(6.13) 



where C is a spatial constant, essentially half dp/dX. This is yet another restriction on the initial data, and these 
results show that we cannot connect arbitrary 3-geometries, i.e., points on superspace, by curves that extremalize the 
action based on the measure (3.6) unless A—1. 

Thus, a consistent BSW-type action with A^l does not exist on superspace. We note, however, that the secondary 
constraints just obtained, particularly the special case p = of (6.12), arise naturally if gravity is treated on conformal 
superspace (CS) ||l^, 0. We have already mentioned our belief that gravity will only be properly understood when 
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treated as a true geodesic theory on CS. We proc eeded directly in that direction in but we have since reahzed 
that the difficulty pres ented by the condition ( 6.12| ) can be overcome in two different ways: either on superspace with 
A = 1 or on CS with ( 6.12| ) imposed Characteristically, both 'escape routes' are indicated by our method. We 
also mention that our A ^ 1 result was already obtained by Giulini some years ago 

We have seen that the sole apparent freedom (the value of A) in the kinetic term T is illusory, and that best 
matching and consistency fix it uniquely. We now apply the same technique to the 'potential' P. One modification 
works. We first show that P = A + DR gives a consistent theory with A as the cosmological constant and D is another 
constant. 

Keeping the uniquely determined DeWitt T, we now modify the BSW action to 



^BSW ^ J '^^J Vg'/DR + AVTd^ 



The momenta conjugate to g.^j are 

pi] 



This can be inverted to give 



( dgij \ 
\ 9^ J 



dX 



g{DR + A) 



T 



dgcd 



cd 



T 



g{DR + A) 



The square-root identity becomes 



(6.14) 



(6.15) 



(6.16) 



gH = -p '■>pij + -p 



g{DR + A) = 0, 



and we get the standard momentum constraint 



The Euler-Lagrange equations are 
dp'^ SC 



dX Sgi 



4{DR+A) 



D 



^W^p'^^^O. 
DRiJ ~ g^iDR- g^^K) 



g{DR+A} 



4(D_R+A) 



4{DR+A) 



iPP- 



We use (6.16) and ( 6.19| ) to evolve the constraints, noting that now TV = yjT / A{DR. + A). We get 



d_ 
dX 



+ N-'^[DR + A-g-^p-''Pab 



{P':j) = [VaiDR + A) - ^ {P'^'pab - Ip") 

-'-rAp''^~\9-p)p\a:b + c^^ {{P';)- 

Thus, as expected, the momentum constraint propagates. For the Hamiltonian constraint we have 

'^{DR + A)-^(p-^Pab-kp^) 



\P')] 



dX 



ADN'-p\., + 2DNp-\^, + \Np ^{DR + A) - (p'^V. 



-TV' 



^{DR + A) - ^ {p'^^pa, 



\P') 



\P') 



(6.17) 



(6.18) 



(6.19) 



(6.20) 



(6.21) 



The right-hand side of ( |6.21 ) also vanishes weakly. 

As we mentioned in the introduction, we can by multiplying by an overall scaling factor set D = -|-1,— 1, or 0. The 
first case is standard general relativity with a cosmological constant, the second case is euclidean gravity, again with 
a cosmological constant, and the third is strong gravity. 
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We now try to modify P more radically. Keeping to our pedestrian approach, we consider two special but illumi- 
nating cases. The first is P = i?" with a a constant. Then, with the DeWitt T, we have 



The momenta conjugate to gij are 



sc 



The inversion gives 



dX 



dgcd 



dX 



1 



gijP +£.j:i, 



(6.22) 



(6.23) 



(6.24) 



so we define N = ^JT/AR". The square-root identity becomes 
and we get the standard momentum constraint 



(6.25) 



(6.26) 



The dynamical equations are 



-^ = — =- ^NR-^ {aW^ - .gy p) ~ 2N (j[f^p,l - ipp^) 



(6.27) 



We use (6.24) and (6.27) to evolve the constraints. Once again, the Hamiltonian constraint docs not propagate: 

d_ 
dX 



V-gR^-^^[p''p^'>-lp' 



AaR°'-^N-'''p\.^ + 2aR"-^Np''\^^ 



\Np 



Pab 



2aN {R''-^y"%ab 



- ^ {p'^^'Pab - \P^ 



(6.28) 



This does not vanish weakly for any a except a — \. We get the extra constraint R — constant. Conserving this gives 
yet another, unpleasant, equation. It is difficult to conceive any solution of this system except static flat space. 
Another choice we tested was 



P = CiR^ 



02-n. Uab 



(6.29) 



where Ci,C2,C3 are arbitrary constants. If gij is taken to have dimensions (length)^, then R will be (length)^^. 
No scalar with dimensions (length)""^ can be constructed from gij . The only geometric scalars that have dimension 
(length)""* are the three in expression (6.29). The other two obvious candidates, the square of the Riemann tensor 



and R .^j, need not appear. The three-dimensional Riemann tensor can be written as a sum of the Ricci tensor and 
R, and the divergence of the Ricci tensor can be eliminated using the Bianchi identity. 

We repeat the calculation, evolving the square-root constraint. This leads to an explosion of unpleasant non- 
cancelling terms that arise from the extra terms in P. 

One soon sees that the same problems will arise for all possible extra terms. We conclude that BSW is the unique 
consistent matter-free theory on superspace based on a 'distance' of the form (1.5). We believe that this is a new 
result. In many respects, our calculations repeat those of HKT |^. The novelty is our weaker assumption. The HKT 
assumptions are: 1) there is a local Hamiltonian constraint, quadratic in the momenta; 2) there is a local momentum 
constraint; 3) the Poisson bracket of these constraints reflects embeddability. Our local square root is equivalent to 
1); best matching is equivalent to 2); constraint propagation on its own replaces 3). There is no need to presuppose 
spacetime. Already latent in (1.5), it is laid bare by consistency. 
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VII. SCALAR FIELD INTERACTING WITH GRAVITY 



There exist matter- free solutions of Einstein's equations on S^. Thus, there is an emergent hght-cone structure in 
the 3-space approach. Besides the pure-gravity hght cone and 4-covariance (which we have recovered), the most basic 
relativisti c fa cts are the universahty of free fah and the universal light cone (all matter fields respecting the gravity 
cone). If ( |3.6| ) is the basis of relativity, both of these further features should be implied by it. 

Let us see how a real scalar field (j) can be introduced. First, best matching essentially fixes the form in which 
4> enters the kinetic term T. For (f> is 'painted' onto the 3-geometries described by the 3-metrics gij , so that the 
correction to its 'naive' velocity d(j)/dX, like the correction to the metric velocity dgtj/dX induced by (3J_), is 
predetermined. It is the scalar product of ^ with the spatial gradient of the matter is 'dragged along' with the 
geometry by the diffeomorphisms. Technically, the correction term is just the Lie derivative of cj) along ^ just as 
if^ is the Lie derivative of gij. The modified T is 



{9 



aCgbd 



dgab 
dX 



m)ai 



dgcd 



dX 



m) 



cd 



dX 



(7.1) 



As here, the coefficient of the scalar kinetic term can always be set to 1 by absorbing a constant into 
modifications to the potential P are 



J. The obvious 



R 



C 



(7.2) 



The first addition is the standard scalar-field term that gives rise to wave propagation. It has the same dimensions, 
(length)"^, as R. If the constant C ^ -1-1, then (j> will not have the same light cone as gravity and local Lorentz 
invariance will be violated. The second addition is a general polynomial non-derivative self-interaction term for (p. 
For n = 2 and A2 = we get the standard mass term for cj). We need not demand that n be an integer. We have 

dropped A, but it can easily be restored. We include neither the higher-order metric terms excluded in Sec. VI nor 
higher-order metric-scalar interactions. We expect that these too can be eliminated. 

The metric momenta are 



sc 



















where Ucj, \s the (j) potential term. Inversion of (7^) gives 



2N ( 



dg. 



cd 



dX 



cd 



(7.3) 



dX Vff V 

where we define 2N = \jTg + T^/R + U^. The momentum conjugate to (j) is 



(7.4) 



5L 



m 



Its inversion gives 



.g(i? + 








2Ntt 




1- ' ^. 



(7.5) 



(7.6) 



The square-root identity becomes 



gH = -/V^, + - + g[R + U^) = 0, 



(7.7) 



and variation w.r.t. ^ gives the momentum constraint 



(7.1 
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The Euler-Lagrange equations for g.^ and <j) are 



View™ 



dn_6£ 
d\ ~ 56 



We can now combine (|7.6[) and (7.10) to give 



dX\ N 



dX 



(7.9) 
(7.10) 

(7.11) 



If we set ^ = TV = 1, n = 2, = "iV4, ^ = 0, this reduces to d'^(f)/d'^X - CV^ 
for a scalar field with mass m and canonical speed VC. 



= 0, i.e., the wave equation 



The new Ha milt onian and momentum constraints (7/7) and (7.8) are the 00 a nd O i Einstein field equations. Note 
that TT (/)'■' from ( |7.8| ) completes the square of 7r^ + l/4(V^p from (7.7). The 1/2 in (7.8) arises because the Hamiltonian 
constraint has 167rp, while the momentum constraint has 87rJ\ These are obtained because of our best- matching 
'dragging' of (j>, which leads to minimal coupling and the equivalence principle. Note that the divergence p'^'.j of the 
gravitational momenta no longer vanishes but equals the (j) 'current' ^^tt'^ (the (j) momentum density), while the 
Hamiltonian constraint picks up (twice) the energy density. 

We now come to the next constraint-propagation result. The momentum constraint propagates, as always, and the 
polynomial self-interaction terms (including the mass term) in the quadratic constraint cause no difficulty; the extra 
terms in its A derivative cancel. But the coefficient C produces non-vanishing terms. We have 



_d_ 



C 



AT* 



hp' 

1^2 



+ C)N7rV^(t) + (2 - 2C)N''7r<j),i + (1 - C)iV7r'V; 



(7.12) 



Although most terms in ( [7.12| ) vanish weakly, the last three, all proportional to 1 — C, do not. They generate a 
secondary constraint. This forms a second-class constraint set with the Hamiltonian and so generates yet another 
constraint. In fact, the Hamiltonian constraint will not propagate unless C = I. But this means that (j) is forced to 
respect the metric light cone. 

Moreover, the mechanism that enforces this will generate the universal metric-matter light cone - for bosons at 
least, we have not yet considered fermions. The mechanism has several hinges, but the linkage is unbreakable. The 
key is the presence of the scalar curvature R in the square-root constraint. The A derivative of this constraint therefore 
contains dR/dX. Now purely by kinematics 



OR 



dgi. 



dX 



^ dX 



R'^ 



dX 



2N 

7s' 



(7J3) 



where we use Eq.(7.4) to replace the time derivative of g with the momentum. The final expression in Eq.(7.13) will 
contain terms with p^.f . We now subtract ircj)'^ to generate the momentum constraint and obtain the first two terms 
on the right hand side of Eq.(7.12). This is the first hinge. The extra inp'^ terms now appear as the C-independent 
parts of the last three terms in Eq.( 7.12 ). The tt^-* terms with the factor C arise from two different sources. The 
(V0)^ ter m i n the Hamiltonian constraint has an explicit C in front of it. The time derivative of this term, combined 



with Eq.(7^) gives the term in the middle of the three. The other two terms arise from the time d erivat ive of the 
TT^ in the square root constraint. To evaluate this we use the Euler-Lagrange equation for (</), tt), Eq.(7.10); the first 
term in this has a C in it. This gives the first and third terms. This is the second hinge. 

Therefore, the constraint-propagation condition contains identical terms with and without the coefficient. This is 
the final hinge in the light-cone generating mechanism, for it makes it inevitable that constraint propagation will 
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enforce C = 1. The universality of the mechanism is ensured by the universal nature of the momentum constraint: 
any field of whatever tensor rank whose velocity appears in the Lagrangian with best-matched correction will be 
represented as a source term in the momentum constraint in a form solely determined by its Lie derivative. We shall 
see in the next section that this mechanism has even more implications. 

For the scalar field, we have thus derived the correct light-cone behaviour of Lorcntz-invariant field theory and can 
see that this will hold uniformly. We believe that this is a new result. We can also show that a derivative coupling 
term in (7.2) cannot be included consistently in the potential of our Lagrangian. 

It turns out that the fact that the 'mass' term in the scalar field can be chosen quite arbitrarily, as we have shown 
above, is part of a wider freedom. It turns out that a large class of what might be called 'dilatonic' theories can be 
written selfconsistently in BSW form. Let us consider a theory which has three constituents. One is the gravitational 
field, represented by a three-metric, gij; one is a massless scalar field, (f> (the dilaton); and the third is a 'massive' 



scalar field, x- The key point is that we assume that the 'mass' of x is some arbitrary function of 



We further assume that both scalar fields are minimally coupled to the metric and that the kinetic energy takes the 
simplest possible form. 

This means that we assume the potential term is 



1 



1 



P = R- ^ff''V,a<^,6 - ^ {g^^aXfi - fWX^) 



and the kinetic term is 



dgab 
dX 



dg. 



cd 



cd 



1 2 



dx ^''^ 



The square-root identity becomes 

gH = -p'^p,j + _ ^2 „ ^2 ^ +U^ + U^) = 0, 
and variation w.r.t. ^ gives the momentum constraint 



(7.14) 



(7.15) 



(7.16) 



(7.17) 



We need to consider the propogation of the square-root constraint. We know that most of the terms will take care of 
themselves and we really need only focus on the derivative of /(</»). One place is in the Euler-Lagrange equation for 



~dX 



This will feed into 



The other place is in the time derivative of where we get 



dX 



dX 



V9 



n<p)x' 



(7.18) 



(7.19) 



(7.20) 



where we used Eq.( |7.6| ) to replace the time derivative of (p with its momentum. The two terms obviously cancel. It is 
clear that we can replace the mass term in x with a polynomial and have each coefficient be a different function of 0. 

Possibly the best known dilaton theory is Brans-Dicke theory . It is interesting to see how this can be written in 
BSW form. Let us start with the simplest possible local square root form with just gravity an d a m inimal ly co upled 
massless scalar field. This means that we choose the potential and kinetic terms as in Eqns. (7.14) and (7.1?;) with 
X = 0. We now perform a 'point' transformation and change variables to (7, $) via: 



gab = $7ab, 



-V4:LU + 61og$. 



(7.21) 



The action changes to 
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ac^bd 



dX 



m)ai 



dX 



cd 



±^cd 



dX 



cd 



I dX 



4ui fd± 

*2 IdX 



$;.e'] , (7.22) 



where = ^* and = $ ^^i. We vary this w.r.t. d^cd/dX and d^/dX to get the momenta conjugate to -fed and $. 
These are 



^cd 




(7 7 -7 7) 



dX 



(7.23) 



and 



(9$ 

dX -'^ 



dX 



(KOcd 



(7.24) 



Then the square-root constraint simply becomes the standard 3+1 Brans-Dicke Hamiltonian constraint and simi- 
larly for the momentum constraint. The square-root identity becomes 



1 



(4w + 6) 



{p - $7r<i>)^ + ^"^R - uj-f'^di^dj^ - 2$V2$ = 0, 



and variation w.r.t. ^ gives the momentum constraint 



7;2 



iyr^^'-' = 0. 



(7.25) 



(7.26) 



This constraint algebra will close because it is just a transformed version of a minimally coupled scalar field. 

There is thus a one-to-one correspondence between solutions of the Einstein equations with a minimally coupled 
massless scalar field and the solutions of the 'vacuum' Brans-Dicke equations. The true difference between the two 
theories is whether the Einstein metric g or the Brans-Dicke metric 7 determines the geodesies. One way to test this 
is to add a second minimally-coupled scalar field. 



Therefore, consider the Brans-Dicke action (7.22) and minimally couple a second, constant mass, scalar field x to 



it. To do this, we add AP — — <&^|(7*''x,iX,i — m'^x^) to the potential term and add AT = [dx/dX — X;i€]'^ to the 
kinetic term. Rather than computing in the B rans- Dicke frame it is easier to see what is happening in the Einstein 
frame. Let us now undo the transformation ( 7.21 ). The Brans-Dicke field translates into Einstein gravity with a 
minimally-coupled scalar field. The kinetic term is of the standard form, 



bd 



dX 



(KOat 



dg. 



cd 



dX 



cd 



+ 



dX 



dx 
dX 



(7.27) 



the additional potential term becomes 



ab 

9 X,aX.b 



(7.28) 



This is a special case of the dilaton theories we discussed earlier. Therefore the constraint algebra will close both in 
the Einstein frame and in the Brans-Dicke frame. Therefore Brans-Dicke fits very naturally into the BSW framework. 

The conclusion of this section must be that, when we consider scalar fields coupled to gravity and insist on a local 
square root action, we recover causality. The characteristic speeds of the scalar fields must coincide with that of 
gravity. Other than that, there is a large freedom in the detailed form of the scalar fields and in the way they interact. 
In the next section we deal with vector fields. There we discover that the detailed form of the vector field is rigidly 
prescribed. The only theory that fits our framework is massless electrodynamics, we recover the Maxwell equations 
unambiguously. 



VIII. THREE- VECTOR FIELD INTERACTING WITH GRAVITY 



Since the kinetic and potential terms of different fields are simply added separately to the potential and T and do 
not mix unless an interaction between them is introduced explicitly, we can treat different fields (scalar, 3-vector, 
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3-spinor) separately. We now consider a 3- vector field Aa- We use the covariant Aa as the independent object, 
matching our use of the covariant metric gat and covariant shift ^a- 

The correction to the 3- vector velocity induced by the difFeomorphism-gcncrating field ^ to implement best matching 
is as unambiguous as it is for the scalar field. It is the Lie derivative of Aa with respect to ^: 



Therefore we add a term 



dAg 
dX 



^ \ dx 



r,bC 



dAg 

dX 



to the metric kinetic energy. 

The additions Ua to the potential are equally obvious: 

Ua = Ca„;6^«'^ + C^Aa-fiA^-^- + CsA^A]^ + Bk{A-A^f 

= (Cig'^^g"'^ + C-,g'"^g'"' + C^g'^'g'"^) A^^cA^.a + Efe By,{g'''' A^A^f . 

Hence we begin with the modified BSW action (which we call A a) 



and vary it to get 



This can be inverted to give 



Aa = j dxj + UA^Tg + TAd^x, 

dgcd 



' 9{R + Ua) 
T, + Ta 



[g'^g'" - g'^g'"') 



dx 



2N / 

71 \ 



dX 



9ijP 



cd 



where we define 2N = sjTg + Ta/R + Ua- The momentum conjugate to Aa is 

'dAi, 



TT = 



5C 



g{R+UA)^ab 



Tg + TA 



dX 



-C^Ai 



This also can be inverted to give 



The square-root identity becomes 



dAi 2N7T, 



gH = -p'^pij + - 7r«7r„ + g{R + Ua) = 0, 



and variation w.r.t. ^ gives the momentum constraint 



p- 



0. 



The Eulcr Lagrange evolution equations for gij and Aa are 

dp'^ 5C 



dX 5gi 



- {R'^ - g'm) - ^ {p'^p^ - \pp'') + Vg {N''^ - g'^V^N) 
[Ci {A^'^A^.a + A^'^A^i) + {A^'^A^^ + ^"-M^'.J + Cs^^ [A''^^ + A^-')] 
+ [(7i^„;6^«;'' + C2Aa;bA^''' + CgAf.AfJ g'^ - 



(8.1) 
(8.2) 



(8.3) 
(8.4) 

(8.5) 

(8.6) 

(8.7) 
(8.8) 

(8.9) 
;8.10) 



+ Vg[Ci+ C2] [NA'-^A^y'" + 7VA('Al'=l'J) - NA^A'-^'^q,^ + 



dg- 



^NY.^kBk{A'^Aaf-^A'A^ + ^NY.^Bk{A<'Aafg'^+C^.p'- 



(8.11) 



ax 



6C 
JAi 



= -2^5 



+ 2^NJ2kBk{A''Aar-^A'+C^7r\ (8.12) 
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We now check for propagation of the constraints ( |8.9| ) and (3.10) under the evolution by ( ^.6[ ), ( |8.8| ), ( ^.11 ) and 
( ^.12 ). As expected, the momentum constraint propagates. In the Hamiltonian constraint, the simple self-interaction 
terms (with coefHcients Bk) with no derivatives of either Aa or gij give no problem at this stage. However, dH/dX 
contains terms that are not proportional to the constraints. The full expression is 



+ \Np V/g {R + Ua)-^ {p^'Pab - Ip" + 7r"7r,) 



C 



N 



4C2-1 

AT 



{2[7VM(,;,)A/(/^ - yS'^)]--'' - [N^A^,,,)Afip'"^ - yg'^^j} 
2^{2[N^A-,gMAfip'^. - ipjp]^'^ - [N^ A-,gMAf (p'"' - '^pg'%f}. 



,Cl±C2 

' N 



.13) 



The last six terms do not vanish weakly. They have to vanish, otherwise we get a family of secondary constraints 
which eliminates all the degrees of freedom. We can (and must!) eliminate most of them by choosing Ci = — C2 = 
— 1/4, C3 = (which means that Ua = — (curM)^/4). This still leaves us with tt^.i^ = as a new, secondary, constraint 
which we cannot eliminate. This is none other than the Gauss constraint. 
Propagation of this new constraint requires 



d 



-2V5 [Nj2kBk{A'^Aa)''-^A' 



.14) 



The only way to ensure propagation is to set Bk = V fc. Therefore, the previously unproblematic undifferentiated 
potential terms are incompatible with the extra constraint. In our scheme, a scalar field can have mass (or power-law 
nonlinear coupling, say (f)^) but a 3- vector field cannot. Only the Maxwell 3- vector field minimally coupled to gravity 
and locked to its light cone survives. 

Since only curl A appears in the potential, addition of a time-independent scalar to Aa, Aa Aa + 9a A, does not 
change the action. This is the analogue of a time-independent 3-coordi nate transformation. If, more appropriately, 
we take a time-dependent A, we need to add the extra term —da^ to (BJ). The role of $ is to absorb the dA/dX 
that arises from the dA/dX term. The ^ terms play just the same role in compensating time-dependent coordinate 
transformations. 

It is now is easy to build the divergence constraint ■K^.^^ = into the action. We simply regard $ as an independent 
variable and vary w.r.t. it, just as with ^. Thus we regard Aa as a gauge field, so that tt^.^^ = becomes the Gauss 
constraint. The full correction to (8.1) is now 

dAa dAa 



dX 



dX 



- - L^Aa = -Q^- g'^Aa^bL 



(8.15) 



and we minimize w.r.t. $ in exactly the same way as w.r.t. ^. This 'gauge best matching' yields the Gauss constraint. 
Thus, the only way we have found to propagate the constraints of a metric-vector BSW-type action is to take 



Ua^I (^"^'A;a - A''-'Aa;b) = ~\ (curU)' 



= Tg + TA = (g g -g g ) 



ah „cd\ 



dgab 
dX 



- {KOab 



dg. 



cd 



+g 



ad 



dAa 



g'^Aa.bL - g'^^b.aA 



bcf 



ax 



ax 



- mu 

aAd 



g'^Ad.bL - g'%;dA 



bci 



dX 



(8.16) 



which is simply Einstein-Maxwell. There is, however, a te chnical point to note here. The Lie-corrected velocity that 
appears in the square parentheses in the final line of (S.16) does not appear in the form generally encountered in the 
literature. However, we can rewrite 



dAa 
dX 



- g'^Aa.b^c 



g'^^b-^aAc 



dAa 

ax 

dAa 

ax 



g''A,,a^, - .g''=A;aCc " g'^^b-.aA, - 

aAa 



- g'^Aa.bi 

-5''Aa;b]Cc-[r^c];a-<i>,a = 



ax 



g A[a,b]^, - ^^a- 



.17) 
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Here, we have replaced the gauge variable $ by a new scalar variable = $ + This means that in (^.17) we 



now have a term linear in the undifferentiated shift which is just the familiar minimally-coupled Maxwell field tensor. 
The residual scalar part that results from this manipulation gets absorbed in the original gauge variable <I>, which is the 
reason why we denote this new composite scalar by 5'. Two points are worth mentioning here: 1) The Lie derivative 
of a 3-vector field becomes ambiguous if the field is gauged, as is reflected in the heterogeneous gauge-diffeomorphism 
nature of 5*; 2) we are obtaining highly sophisticated theories - Einstcinian gravity and Maxwell minimally coupled 
to gravity ~ essentially uniquely out of the 3-space approach but always only 'by the skin of our teeth'. 

It is illuminating to consider why a massive vector field, which is a perfectly good generally covariant theory, is not 
allowed in the 3-space approach. As Giulini pointed out to us j2l], the ADM decomposition for the massive vector 
field is quite different from the massless case. While the momentum can be written as 



N 



dt 



(8.18) 



which looks identical to the last term in Eq.( S.17 ), there is a major difference in that, while is a true gauge degree of 
freedom and can be chosen arbitrarily, Aq in the massive vector field case is fixed. There exists a primary constraint 
that the momentum conjugate to Aq, tt^ , vanishes. The conservation of this constraint leads to 



4 + [Ao - N^Aa] = 0. (8.19) 



This is quite different from the massless case, for which, of course, m = 0, so that the second term is zero and we 
get a constraint that is homogeneous (and linear) in the canonical momenta and contains neither Aq nor the shift. 
It is this restriction to the canonical momenta that limits the physical degrees of freedom and indicates that the 
action is defined on curves in the space that is the product of superspace and the transverse degrees of freedom of the 
vector field. This matches our ideal of a geodesic- type theory. In the massive case, the restriction is lifted since the 
part homogeneous in the canonical momenta is now equal to a term containing what were previously purely gauge 
variables. In whatever way one attempts to interpret this extra term, it is clear that the massive vector field does not 
belong to the class of theories in which best matching with a local square root is applied to bona fide three-dimensional 
geometrical objects. 

Thus, we find that a non-gauge vector field cannot be coupled in any simple manner to the BSW action. For the 
reasons to be explained in the conclusions, we think it would be premature to attempt a rigorous no-go theorem, but we 
feel the provisional result is already remarkable and even hints at a partial unification of gravity and electromagnetism. 
The fact is that in pure geometrodynamics the BSW action is essentially unique, and we have found only one vector 
field that couples to it: Maxwell. Since our approach exploits 3-geometry to the maximal extent possible but nothing 
else, we can say that Maxwellian theory is uniquely inherent in Riemannian 3-geometries. We have already noted 
that, within our scheme, the attempt to couple a single 3-vector field to scalar fields leads to the standard U(l) gauge 



coupling 1 12 1, and the attempt to let a collection of 3-vector fields interact among themselves leads to Yang-Mills 
gauge theory Q . 

Our result also gives further insight into the origin of Lorentz invariance. We begin with the field Aa, which is as 
unashamedly three dimensional as the 3-metric. How does the full panoply of the 4-potential A^^v — 1,2,3,4, and 
the electromagnetic field tensor F^j^ arise? The answer is that the extra ('time') elements arise from the combined 
effect of the Lie-derivative best-matching correction to the 'bare' 3- velocity dAa/dX and from having to propagate 
the new square-root constraint. Best matching and the local square root do it all. It is particularly striking that 
the universal light cone and gauge theory arise from one and the same Lie-derivative correction mechanism. For the 
scalar field, the terms generated by the Lie correction in the momentum constraint fix one constant in the constraint- 
propagation condition and lock it to the gravity light cone. For the 3-vector field, they fix three coefficients, with the 
same effect, and impose the Gauss constraint. So, in a way, light cone and gauge are the same thing. Both derive 
from best-matched gravity. 

Some results of this section are only partly new, since Teitelboim showed that his postulates (see the end of 
Sec. VI) enforce gauge coupling. As in the case of the HKT result [||, we obtain his result with a significantly weaker 
assumption, and we also obtain the light cone and reveal its intimate connection to gauge theory. 

IX. CONCLUDING REMARKS 

Hitherto it has always seemed that four-dimensional general spacetime covariance is the very essence of GR. Many 
physicists have expressed strong reservations about the 3-t-l Hamiltonian formalism. It is held to be against the spirit 
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of general covariance and incapable of encompassing the wide range of topologies allowed by GR. The restriction to 
globally hyperbolic spacetimes - a necessary condition for the Hamiltonian treatment - is often severely criticized. 

We believe that the present work, if it can be successfully extended to the fermionic sector and withstands critical 
peer review, puts these issues in a different light. The fact is that, in a choice between two different theoretical 
schemes, there must always be a preference (in the absence of compelling experimental evidence) for the one that 
is more restrictive and, hence, makes stronger predictions. There are two respects in which the 3-space approach 
gives tighter predictions than the Einstein-Minkowski approach: it rules out many fields (the massive vector field 
for example) and, within the possibilities that remain, rules out many solutions, e.g., solutions with closed timelike 
curves, which are not globally hyperbolic. These are potential benefits. 

Here we should also mention the possibility of advancing to genuinely new physics. Our present paper does not 
actually predict any new theory. It merely slims down a class of theories long known to exist (all generally covariant 
theories) and, within the restricted class, rules out exotic scenarios (time travel for example). However, given that 
the treatment of fermions in curved space is so delicate, we do not rule out the possibility that the extension to the 
fermionic sector (if it succeeds) will bring further insights. There is also the possibility ||], already mentioned in 
the paper, of extension of the idea of best matching to conformal superspace (CS). As yet, we have performed this 
extension only for the matter-free case, but the results so far obtained are promising. They suggest the existence of 
a theory virtually identical to GR except for the elimination of scale as a dynamical variable. 

If a full theory can be developed on CS, the implications are far reaching. In the truly scale-invariant theory that 
will result, the Hubble red shift cannot be explained by the 'stretching of space', as it is at present. For the theory 
on CS is designed precisely to ehminate such 'stretching' as a degree of freedom. According to the present standard 
model, the universe is both expanding and simultaneously changing its 'shape', i.e., becoming more 'clumpy'. The 
expansion is used to explain the Hubble red shift. In a scale-invariant theory, only change of shape is physically 
meaningful, and it must explain the Hubble law though a gravitational red shift induced by 'clumping' p2|). 
Thus, the 3-space approach does have the potential to lead to very different cosmological predictions. 

Finally, it is worth mentioning that the local square root creates a theory that seems to be maximally sensitive 
t o a ll properties of Riemannian 3-geometries, as ca n be seen by comparing the global form (3/7) with the local form 
(3.6). The product of two global integrals in (3.7) cannot be as sensitive as the local form. Indeed, it seems to us 
that the twin principles of best matching and the local square root may implement the Cartesian ideal of explaining 
all dynamics by geometry. 
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